Mining Constraint-based Multidimensional Frequent Sequential Pattern in Web Logs

نویسندگان

  • S. Vijayalakshmi
  • S. Suresh Raja
چکیده

In this paper we introduce an efficient strategy for discovering Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in detail.The main objective of multidimensional sequential pattern mining is to provide the end user with more useful and interesting patterns. To mine such kind of sequence data, we have used an extended version of the prefixspan(EXT-Prefixspan) algorithm to extract the Constraint-based multidimensional frequent sequential patterns in web usage mining. A web access pattern is a sequential pattern that is pursued frequently by users. Using these sequences as prefixes a projected database is constructed which is then recursively mined to find the frequent sequential patterns. The EXT-Prefixspan mines the complete set of patterns but greatly reduces the efforts of candidate subsequence generation. Moreover, prefix –projection substantially reduces the size of projected database and leads to efficient processing. We show that the EXT-Prefixspan algorithm is more flexible at capturing desired knowledge than previous Algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey

Sequential pattern mining is the process of applying data mining techniques to a sequential database, to extract frequent subsequences to discover correlation that exists among the ordered list of events. Web Usage mining (WUM) discovers and extracts interesting knowledge/patterns from Web logs is one of the applications of Sequential Pattern Mining. In this paper, we present a survey of the se...

متن کامل

Performance Improvements and Efficient Approach for Mining Periodic Sequential Access Patterns

Surfing the Web has become an important daily activity for many users. Discovering and understanding web users’ surfing behavior are essential for the development of successful web monitoring and recommendation systems. To capture users’ web access behavior, one promising approach is web usage mining which discovers interesting and frequent user access patterns from web usage logs. Web usage mi...

متن کامل

Web Access Pattern Algorithms in Education Domain

Sequential pattern mining discovers frequent user access patterns from web logs. Apriori-like sequential pattern mining techniques requires expensive multiple scans of database. So, now days, WAP (Web Access Pattern) tree based algorithm is used. It is faster than traditional techniques. However, the use of conditional search strategies in WAP-tree based mining algorithms requires re-constructi...

متن کامل

Multidimensional Web Access Pattern Tree (MD-WAP Tree)

Mining frequent web access patterns from large data (web log) is one significant application of sequential pattern mining. Web access patterns are set of frequent sub sequences that are useful to know user behaviour in real time in order to make dynamic decisions. Techniques for extracting web access patterns from data available in two flavours: apriori based and non apriori based (tree based)....

متن کامل

Mining Frequent Sequential Patterns under a Similarity Constraint

Many practical applications are related to frequent sequential pattern mining, ranging from Web Usage Mining to Bioinformatics. To ensure an appropriate extraction cost for useful mining tasks, a key issue is to push the user-defined constraints deep inside the mining algorithms. In this paper, we study the search for frequent sequential patterns that are also similar to an user-defined referen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009